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Formation and dissociation mechanisms of C-C* base pairs in acidic and alkaline environments are investigated, 
employing ab initio quantum chemical calculations. Our calculations suggest that, in an acidic environment, a cytosine 
monomer is first protonated and then dimerized with an unprotonated cytosine monomer to form a C-C* base pair; in an 
alkaline environment, a protonated cytosine dimer is first unprotonated and then dissociated into two cytosine monomers. 
In addition, the force for detaching a C-C* base pair was found to be inversely proportional to the distance between the two 
cytosine monomers. These results provide a microscopic mechanism to qualitatively explain the experimentally observed 


reversible formation and dissociation of i-motifs. 
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1. Introduction 


I-motif is a cytosine-rich DNA oligomer with a tetrameric 
structure formed in an acidic environment. It is a four-stranded 
complex consisting of two base-paired parallel-stranded du- 
plexes, each of which is held together by protonated cytosine— 
cytosine (C-C*) base pairs. In an acidic environment, i-motif, 
a cytosine-rich DNA oligomer, always adopts a four-stranded 
structure with two associated and fully intercalated parallel- 
stranded duplexes, as has been shown by NMR,!!-?! X-ray 
diffraction analysis“! and circular dichroism studies.'*! The 
building blocks of an i-motif are base pairs stacked face- 
to-face with each other."! An i-motif unfolds and dissoci- 
ates in an alkaline environment but refolds if the environment 
changes back to acidic.!'°! Therefore, the i-motif can be used 
as a molecular machine, switching between unfolded random 
strings and folded i-motif structures.''-!3! Understanding the 
reversible folding and unfolding mechanisms of the i-motif is 
essential for its applications. 

Due to the lack of experimental data on cytosine-base 
interactions, the nature of i-motif formation and dissocia- 
tion mechanisms remains obscure. Gas-phase experiments 
can reliably reveal complex characteristics, !!*:!>! but these ex- 
periments are very difficult even for H-bonded DNA base 
pairs. Observation of protonated cytosine—-cytosine (C-C*Ħ) 
base pairs outside the i-motif is challenging, so only gas-phase 
data exist for the base pairs of cytosine derivatives."'©! Other 
methods, such as base association in polar and nonpolar sol- 
vents, on electrodes and with X-rays, provide only indirect in- 
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formation about the cytosine-base interactions. !!7] 


In this work, we use ab initio quantum chemical 
calculations!'*! to investigate the formation and dissociation 
mechanisms of a C-C* base pair. Our calculations suggest 
that, in an acidic environment, a cytosine monomer is first 
protonated and then dimerized with an unprotonated cytosine 
monomer to form a C—C* base pair; in an alkaline environ- 
ment, a C-C* base pair is first unprotonated and then dis- 
sociated into two cytosine monomers. Our ab initio calcu- 
lations on C—C* base pairs provide useful information for un- 
derstanding the molecular origin of the formation and dissoci- 
ation of i-motif; our results can be systematically refined when 
much larger computational resources are available. 


2. Computational method 


To obtain energy differences between configurations, nine 
molecular structures were set up and run through ab initio 
calculations: (i) two cytosine monomers; (ii) a C-C* base 
pair; (iii) two protonated cytosine monomers; (iv) an unpro- 
tonated cytosine—cytosine dimer; (v) a water molecule; (vi) a 
hydronium ion; (vii) a hydroxyl ion; (viii) a C-C* base pair 
with the hydrogen ion located at different places between the 
two 3’ end nitrogen atoms; and (vx), a hemiprotonated cy- 
tosine dimer with different distances between the two 3’ end 
nitrogen atoms. The cytosine monomer and dimer configura- 
tions were taken from the NMR structure of the i-motif (PDB 
ID:1YBN).!!°! The initially constructed molecular structures 
were optimized by the ab initio calculations at the HF/6- 
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311g** level; the total energies of the optimized structures 
were calculated at both the HF/6-311g** and MP2/6-311g** 
levels, plus another optimization and energy calculation pro- 
cedure at the B3LYP/6-311g** level was made for comparison. 
All ab initio calculations were performed using the Gaussian 


software. [8] 


3. Results and discussion 
3.1. Optimized C-C* base pair configuration 


According to its NMR structure shown in Fig. l(a), i- 
motif is are constructed by the stacking of C-C* base pairs. 
However, since the hydrogen atoms cannot be identified by 
the NMR scanning, the exact location of the proton in the 
C-C™ base pair is unknown. Therefore, we added a proton 
between the 3’ end nitrogen atoms; the optimized dimer struc- 
ture is shown in Fig. 1(b). In the optimized structure, the 


distance between the two 3’ end nitrogen atoms is 2.89 A and 


Fig. 1. (color online) (a) Cytosine dimer configuration taken from the i- 
motif NMR structure without the central hydrogen atom. (b) Optimized 
hemiprotonated cytosine dimer configuration with the central hydrogen 
atom. 
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Fig. 2. (color online) Energy landscape of the hemiprotonated cytosine 


dimer with the hydrogen ion moving along the line between the two 3’ 
end nitrogen atoms. 


the hydrogen ion, lined up with the two nitrogen atoms, is 
1.02 A away from one 3’ end nitrogen atom and 1.87 A away 
from the other one. The energy landscape obtained by mov- 
ing the proton along the line between the two nitrogen atoms, 
as shown in Fig. 2, is a double well with an energy barrier of 
about 20 kJ/mol calculated at the HF and B3LYP levels (about 
45 kJ/mol at the MP2 level). These results indicate that the 
hydrogen atom prefers to stay close and chemically bonded to 
either of the two 3’ end nitrogen atoms, which is consistent 
with the NMR results. 7° 


3.2. Formation and dissociation of C-C* base pair 


The formation and dissociation of C-C* base pairs are 
crucial for understanding the folding and unfolding mecha- 
nisms of the i-motif, as C-C* base pairs are the building 
blocks of an i-motif. In this work, we propose a reversible 
chemical-reaction mechanism for the formation and dissocia- 
tion of a C-C* base pair based on our ab initio calculations. 
The postulated chemical reaction paths are shown in Fig. 3. 

Two possible C-C* base-pair formation paths in an acidic 
environment are illustrated in Fig. 3(a). Since hydronium ions 
are abundant in an acidic environment and easily lose hydro- 
gen atoms, it is natural to regard the formation of C-CT base 
pairs from two unprotonated cytosine monomers and one hy- 
dronium ion. In the first proposed reaction path, the two cy- 
tosine monomers first attract each other to form an unpro- 
tonated cytosine dimer and then one hydrogen atom is de- 
tached from a hydronium ion and attached to one of the 3’ 
end nitrogen atoms; in the second reaction path, one hydro- 
gen atom is detached from a hydronium ion and attached to 
one of the cytosine monomers before the protonated cyto- 
sine and the unprotonated cytosine attract each other to form 
a C-Ct base pair. The configurations with different com- 
binations of those molecules went through ab initio calcula- 
tions; the obtained reaction enthalpies for various steps are 
listed in Table 1. 
enthalpy for a cytosine monomer and a hydronium to form 


As shown in Table 1, the total reaction 


a C-C* dimer and a water molecule, calculated by the MP2 
method, is about —747.121 kJ/mol (—742.696 kJ/mol by HF 
and —743.172 kJ/mol by B3LYP), which suggests that, in an 
acidic environment, it is energetically favorable for two cyto- 
sine monomers to snatch a proton from hydronium and form 
a C-Ct base pair. Moreover, the second reaction path men- 
tioned above is energetically more favorable than the first one, 
since it costs energy for two cytosine monomers to form an 
unprotonated cytosine dimer (AH? ), while it releases energy 
for a cytosine monomer to snatch a proton from hydronium 
(AH3, ) before another energy-releasing process takes place to 
form the C-C* base pair. This is consistent with the experi- 
mental results," showing that i-motifs only fold in an acidic 
environment but unfold even in a weak alkaline environment. 
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Fig. 3. (color online) (a) Two possible chemical reaction paths for the formation of a C-CT base pair. The reactants are on the top. (b) 
Two possible chemical reaction paths for a C-C* base pair to dissociate. The reactants are at the bottom. 


Table 1. Calculated reaction enthalpies for the reaction paths in Fig. 3. 


Reaction enthalpy HF/(kJ/mol) MP2/(kJ/mol) B3LYP/(kJ/mol) 
AH? 82.229 58.512 85.354 
AH}, -824.925 -805.634 -828.526 
AH3 -546.274 -524.526 -538.595 
AH3, -196.421 -222.596 -204.577 
AH* -742.696 -747.121 -743.172 
AR? 82.229 58.512 85.354 
AH?, 231.670 266.593 230.715 
AH} 510.321 547.701 520.646 
AH}, -196.421 -222.596 -204.577 
AH? 313.899 325.106 316.069 


In contrast, two possible C-Ct base-pair dissociation 
paths in an alkaline environment are illustrated in Fig. 3(b). 
Hydroxyls are considered to participate in the dissociation re- 
action since they are abundant in an alkaline environment. In 
the first proposed reaction path, the C-C* base pair first gives 
a proton to the hydroxyl before the unprotonated cytosine 
dimer dissociates into two unprotonated cytosine monomers; 
in the second reaction path, the C-C* base pair first disso- 
ciates into one protonated cytosine monomer and one unpro- 
tonated cytosine monomer and then the protonated cytosine 
monomer gives out a proton to the hydroxyl. Our calcula- 
tions indicate that the total reaction enthalpy for a C-C* base 
pair to give up a proton to the hydroxyl and dissociate into 
two cytosine monomers is negative (—325.106 kJ/mol with 
MP2, —313.899 kJ/mol with HF and —316.069 kJ/mol with 
B3LYP), suggesting that it is energetically favorable for a C— 
C* base pair to dissociate in an alkaline environment. Accord- 


ing to the data listed in Table 1, in the alkaline environment, a 
cytosine protonated cytosine dimer dissociates by giving up a 
proton to form an unprotonated cytosine dimer before dissoci- 
ating into two cytosine monomers, which is the first proposed 
reaction path mentioned above. This result is consistent with 
the existing experimental results. ?!! From the energetic point 
of view, the energy released from the combination of proton 
and hydroxy] is larger than the energy for a C-C* base pair to 
give up a proton; however, the energy released from the com- 
bination of a proton and a water molecule is much smaller. 
This ensures that the dissociation process can take place only 
in an alkaline environment. 


4. Dissociation energy and force for detaching a 
C-C* base pair in a vacuum 


Since the i-motif can be used as a molecular machine re- 
versibly switching between unfolded random strings and the 
folded i-motif structures,!!!-!3] the energy and force needed 
to detach a C-C™ base pair is an essential topic of interest. 
Therefore, we varied the distance between the two 3’ end ni- 
trogen atoms and the energy and force were calculated for 
each distance. The consequent energy and force landscapes 
are shown in Fig. 4. As the distance increases, the energy 
monotonically increases but the increasing speed slows down 
(Fig. 4(a)). Correspondingly, the force decreases drastically 
with the distance (Fig. 4(b)) and an inversely proportional 
relationship was found between the force and the distance 


(Fig. 4(c)). 
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Fig. 4. (a) Energy landscape for detaching a C-C* base pair along the 
line between the two 3’ end nitrogen atoms. (b) The force landscape 
corresponding to panel (a). (c) Same plot as panel (b), but with the 
X-axis representing the inverse of the distance. 


5. Conclusions 


In summary, we have performed ab initio quantum chem- 
ical calculations to investigate the C-C* base pair formation 
and dissociation mechanisms in acidic and alkaline environ- 
ments, respectively, which is essential for understanding the 
transformation of an i-motif under different chemical condi- 
tions. Our calculations suggest that the hydrogen ion plays an 
important role in the formation and dissociation of C-CT base 
pair. In an acidic environment, two cytosine monomers cannot 
form a dimer without a proton. One cytosine monomer has to 
be protonated by snatching a proton from a hydronium before 
it can combine with the other cytosine to form a C—C™ base 


pair. In contrast, in an alkaline environment, a C-C™ base 
pair gives up a proton to a hydroxyl and becomes an unpro- 
tonated cytosine dimer before it dissociates into two cytosine 
monomers. Our calculations also demonstrate that the force 
for detaching two cytosine monomers in the C-C* base pair 
in a vacuum is inversely proportional to the distance between 
them. 


It should be noted that our calculations assume greatly 


simplified conditions compared with experiments. Interac- 


tions between base pairs may further stabilize the i-motif but 
the environment, the configurational flexibility and thermal 
fluctuations may greatly reduce the reaction enthalpies. Nev- 
ertheless, the reaction paths we have determined and the mech- 
anisms we have proposed provide an important qualitative un- 
derstanding of the formation and dissociation of the i-motif 
and form the basis for future, more sophisticated theoretical 
studies of i-motif folding and unfolding mechanisms. 
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